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ABSTRACT: Implementing phonetics to Natural language is a Herculean task. The first step for the phonetic 
implementation is to generate a phonetic dictionary. Phonetic dictionary is very important as it plays a vital 
role in identifying the essential components of the gigantic vocabulary in the speech recognition system of 
natural language. Language like English needs phonetic transcriptions because the English spelling does not 
tell us how to pronounce it. Pronunciation is very important for communication as it vitalizes rapid transition to 
practical concern. There is lot of ongoing research in multilingual speech recognition. Phonetic dictionary is 
utmost important for any speech recognition system and hence the key goal of the paper is to build a phonetic 
dictionary for English language. The paper aims at building dictionary for identifying the phonetic 
transcriptions of International Phonetic Alphabet (IP A) and American English alphabets phonetically. The 
paper also gives a detailed explanation about the rules of the phonemes. 

KEYWORDS: Natural Language Processing (NLP), Phonetics, Acoustic phonetics, Phonemes, 
Phonetic transcriptions, WordNet. 

I. INTRODUCTION 

According to the WordNet the phonetics is defined as a branch of NLP which is concerned with the 
speech production and perception and a detailed analysis of acoustics. Acoustic phonetics is a branch of 
phonetics which deals with the study of sounds that is made by the vocal organs of the human in order to 
produce the sound. The acoustic knowledge is very important to locate the underlying phonetic representation. 
Acoustics discusses how the sound travels from speakers' mouth to listeners' ears. The challenges of identifying 
the language and storing the corresponding sound for phonetic transcription are tedious and tiresome. The 
International Phonetic Alphabet (IP A) and the acoustic modeling of various phonemes have shown rationally 
good improvement for the implementation and understanding of the sound structure. Various approaches such as 
Statistical Machine Translation (SMT), language identification system (LID) merges gap between the encoding 
and the decoding process by using multi stream approach [1]. 

Phonemes are the basic building block for the language which is combined with the other phoneme to 
build a meaningful unit such as words and morphemes. Phonemes are the individual or group of sound units. 
The phonetic codes include syllable code, syllable time and syllable rhythm. The phonemes form the bedrock 
for initial and final code form to represent the phonetic structure with sensibility and sub-information of 
linguistics in syllable code [2]. 

Phonetic transcriptions tell us how the word has to be pronounced. These phonetic transcriptions are 
written in IPA where each and every English alphabet has its own symbol. For example the IPA based phonetic 
transcription for the words such as no is nou, and the transcription of do is du:. Though both of these words end 
with the same letter they have different sound and the phonetic transcriptions are different. The transcription for 

the word alphabet is[*^ P ^1 Phonetic transcriptions are helpful because there are no convincing 
empirical results which help us to sort the pronunciation training [5]. The below table mention some of the 
examples for phonetic transcriptions. 

/ ek.spek'tei.fn/ expectation 
main stress 1 

secondary stress / ri: ' tell/ retell 

syllable division /' sis.tam/ system 

Table- 1 phonetic transcriptions 
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II. LITERATURE SURVEY 

It is said that there are around 4000 languages spoken today across the world and most of these 
languages have limited linguistic knowledge and speech data/resources are available. Author Akbacak, M et al 
have proposed series of speech retrieval algorithms in order to leverage the already existing algorithms. The 
algorithms employ confusion -embedded hybrid pronunciation networks, and lattice-based phonetic search 
within a proper name retrieval task. Latin -American Spanish are used as the target language. After searching for 
queries consisting of Spanish proper names in Spanish Broadcast News data, we demonstrate that retrieval 
performance degradations (due to data sparseness during automatic speech recognition (ASR) deployment in the 
target language) are compensated by employing English acoustic models [9]. Speech recognition system for 
various languages have been studied and developed. Arabic speech recognition system is developed at 
Cambridge University by Gales MJ F et al [6]. Author has shown a simple scheme for automatically generating 
associations among diverse pronunciations for use in training and reducing the phonetic out-of-vocabulary rate. 
Author Palanisamy K et al, has developed Tamil pronunciation dictionary by incorporating the visual actions of 
the organs for improving communication skills [7]. Phonetic transcriptions by using Speech Assessment Method 
Phonetic Alphabet (SAMP A) for Romanian language has been proposed by Domokos, J et al. Development of 
phonetic dictionary and also the system architecture is explained in the paper [8]. 

In the next section various methodologies for building the phonetic dictionary and the associated rules 
are discussed. 

III. METHODOLOGIES 

Today almost all the English dictionaries have audio recordings. So even with this, the need of 
phonetic transcriptions is prominent because of the following reasons [3] 

> To have a good communication in English first we have learn the pronunciations and English language 
is polysemy in nature. So learning the pronunciations through sound is apt in these kinds of situations. 

> When we are listening to the pronunciations we might not be sure whether we heard 
u or a, t> or a, s or z, etc., this is due to the lack of experience and the quality of the particular sound is poor. 
Hence reading phonetic transcriptions will make the phonemes clear as we will see the sound symbols of all the 
alphabets in a word 

> Dictionaries often show multiple transcriptions but have the same pronunciations so the better way to 
find the reasons for this ambiguity would be to study the information about the particular transcriptions. 

> The computer which we are using might not have the speakers and hence the audio recording of the 
particular sound cannot be heard. Sometimes even if the speakers are available we do not want to disturb the 
public with the sound. In these situations transcriptions will greatly help. 

Word Stress: A word is combination of syllables. In every word, one or more letters are pronounced 
strongly. This is called word stress. The dictionary which is shown in this paper also contains the word stress. 
Pronunciation for English language is broadly classified into American and British transcriptions. In American 
English, the sound of r is always stressed where as in British English r is not stressed. Example: arm/ father are 
pronounced differently in both of the accent. 

IPA: International Phonetic Alphabet is the abbreviation for IPA. It defines the standard phonetic 
symbol for every alphabet in English language. The IPA symbols are usually written in the Latin symbols. IPA 
defines the standard sound representation for oral language. IPA is considered as the standard for linguistics. 
However there is also the American phonetic alphabet because of the difficulties faced with IPA such as, it is 
tough to type the IPA symbols with the normal keyboard. IPA also increases the error rate by withholding the 
awkwardness reading the transcriptions which are hand written. 

IV. BUILDING PHONETIC DICTIONARY 

Phonetic dictionary is not as simple as it looks because applying sounding mechanisms to make 
computer auto detect the word and give the pronunciation is difficult. There are various rules that need to be 
followed while building the phonetic transcriptions. The rules are [3] 

[1]. Almost all dictionaries use the e symbol for the vowel in bed. The problem with this convention is 
that e in the IPA does not stand for the vowel in bed; it stands for a different vowel that is heard, for 
example, in the German word Seele. The "proper" symbol for the bed vowel is £ (do not confuse with 3:). 
The same goes for ea vs. £3. 
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[2]. Iii3 r and3: r , the ' is not pronounced in BrE, unless the sound comes before a vowel (as in 
answering, answer it). In AmE, the r is always pronounced, and the sounds are sometimes written 
as &■ and 3-. 

[3]. In AmE, a: and r> are one vowel, so calm and cot have the same vowel. In American transcriptions, hot is 
written as ha:t. 

[4]. About 40% of Americans pronounce o: the same way as a:, so that caught and cot have the same vowel. 
See cot-caught merger. 

[5]. In American transcriptions, o: is often written as r>: (e.g. law= Id:), unless it is followed byr, in which 
case it remains an o:. 

[6]. In British transcriptions, ou is usually represented as au. For some BrE speakers, ou is more appropriate 
(they use a rounded vowel) — for others, the proper symbol is au. For American speakers, ou is usually 
more accurate. 

[7]. In ea r ra r ua r , the r is not pronounced in BrE, unless the sound comes before a vowel (as in dearest, dear 
Ann). In AmE, the r is always pronounced, and the sounds are often written as er ir ur. 

[8]. All dictionaries use the r symbol for the first sound in red. The problem with this convention is that r in 
the IPA does not stand for the British or American r; it stands for the "hard" r that is heard, for example, 
in the Spanish word rey or Italian vero. The "proper" symbol for the red consonant is j. 

[9]. In American English, t is often pronounced as a flap t, which sounds like d or (more accurately) like the 
quick, hard r heard e.g. in the Spanish word pero. For example: letter. Some dictionaries use 
the t, symbol for the flap t. 

Table II describes the English alphabets and the corresponding IPA and American phonetic alphabets. Word 
stress is also highlighted. 

Table II IPA and phonetic Transcriptions 



English Alphabet 


a 


b 


c 


d 


e 


f 


g 


h 


i 


J 


k 


1 


M 


n 


o 


IP A/American Phonetic Alphabet 




bi 


si 


di 


I 


ef 


Ji/d3i 


etj/ec 


aj/ay 


d3e/je 


ke 


el 


em 


en 


o 


Stress Marked across the Alphabets 


e 


bi 


si 


di 


i 


ef 


ji/d3i 


etf/ec 


aj/ay 


d3e/|e 


ke 


el 


em 


en 
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English Alphabet 


P 


Q 


r 


s 


t 


u 


V 


W 


X 


Y 


z 


IP A/American Phonetic Alphabet 


Pi 


kyu/kju 


ar 


es 


ti 


ju/yu 


vi 


dsbslju/dsbslyu 


eks 


waj/way 


zi 


Stress Marked across the Alphabets 


Pi 


kyii/kju 


ar 


es 


ti 


ju/yu 


vl 


dsbslju/dsbslyu 


eks 


waj/way 


zl 



Table III shows the IPA phonetic symbols and its phonetic description. The short cut key defines the letters that 
we have to type in order to get the appropriate phonetic symbol. 



Alphabet 

s 


Phoneti 

c 

symbol 


Phonetic 
description 
for the 
symbol 


Short cut key 




a 


open back 

unrounded 

vowel 


(Ctrl+A) 




ae 


near-open 
front 

unrounded 
vowel 


(Ctrl+AA) 




B 


near-open 
central vowel 


(Ctrl+AAA) 


A 


a 


nasalized open 
back 

unrounded 
vowel 


(Ctrl+AAAA) 




P 


voiced bilabial 
fricative 


(Ctrl+B) 




b 


voiced bilabial 
implosive 


(Ctrl+BB) 


B 


B 


bilabial trill 


(Ctrl+BBB) 



Alphabet 

s 


Phoneti 

c 

symbol 


Phonetic 
description 
for the 
symbol 


Short cut key 


M 


m 


labiodental 
nasal 


(Ctrl+M) 






velar nasal 


(Ctrl+N) 






palatal nasal 


(Ctrl+NN) 




N 


uvular nasal 


(Ctrl+NNN) 


N 


n. 


retroflex 
nasal 


(Ctrl+NNNN) 




3 


open-mid 
back rounded 
vowel 


(Ctrl+O) 


0 


ce 


open-mid 
front rounded 
vowel 


(Ctrl+OO) 
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voiceless 






e 


alveo-palatal 
fricative 


(Ctrl+C) 






voiceless 




c 


S 


palatal 
fricative 


(Ctrl+CC) 






voiced dental 






9 


fricative 


(Ctrl+D) 






voiced 






d3 


postalveolar 
affricate 


(Ctrl+DD) 






voiced 








retroflex 






4 


plosive 


(Ctrl+DDD) 






voiced 








alveolar 




D 


, /■ 

Q 


implosive 


(Ctrl+DDDD) 






mid-central 






3 


vowel 


(Ctrl+E) 






rhotacized 








mid-central 






a- 


vowel 


(Ctrl+EE) 






close-mid 








central 








rounded 






e 


vowel 


(Ctrl+EEE) 






close-mid 








central 








unrounded 




E 


3 


vowel 


(Ctrl+EEEE) 






open-mid 
front 








unrounded 






e 


vowel 


(Ctrl+3) 






open-mid 
central 








unrounded 






3 


vowel 


(Ctrl+33) 






rhotacized 








open-mid 
central 








unrounded 






3- 


vowel 


(Ctrl+333) 






nasalized 








open-mid 
front 








unrounded 






e 


vowel 


(Ctrl+3333) 






open-mid 
central 








rounded 




F 


e 


vowel 


(Ctrl+33333) 






voiced velar 






cf 


implosive 


(Ctrl+G) 






voiced uvular 






G 


plosive 


(Ctrl+GG) 






voiced uvular 




G 


tf 


implosive 


(Ctrl+GGG) 





0 


close-mid 
front rounded 
vowel 


(Ctrl+OOO) 




T> 


open back 
rounded 

vowel 


(Ctrl+OOOO) 




5 


nasalized 
open-mid 
back rounded 
vowel 


(Ctrl+OOOOO) 




CE 


open front 
rounded 

vowel 


(Ctrl+OOOOOO 

) 


P 




voiceless 

bilabial 

fricative 


(Ctrl+P) 




r 


alveolar tap 


(Ctrl+R) 




K 


voiced uvular 
fricative 


(Ctrl+RR) 




1 


alveolar 
approximant 


(Ctrl+RRR) 




I 


retroflex 
approximant 


(Ctrl+RRRR) 




R 


uvular trill 


(Ctrl+RRRRR) 




r. 


retroflex flap 


(Ctrl+RRRRRR) 


R 


i 


alveolar 
lateral flap 


(Ctrl+RRRRRR 
R) 




S 


voiceless 

postalveolar 

fricative 


(Ctrl+S) 


S 


§ 


voiceless 
retroflex 
fricative 


(Ctrl+SS) 




© 


voiceless 

dental 

fricative 


(Ctrl+T) 




? 


voiceless 

postalveolar 

affricate 


(Ctrl+TT) 




5 


voiceless 

alveolar 

affricate 


(Ctrl+TTT) 


T 


t 


voiceless 
retroflex 
plosive 


(Ctrl+TTTT) 
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labial-palatal 
approximant 


(Ctrl+H) 




fi 


voiced glottal 
fricative 


(Ctrl+HH) 




h 


voiceless 

pharyngeal 

fricative 


(Ctrl+HHH) 




8 


Sje-sound 


(Ctrl+HHHH) 


H 


H 


voiceless 
epiglottal 
fricative 


(Ctrl+HHHH 
H) 




I 


near-close 
near-front 
unrounded 
vowel 


(Ctrl+I) 




I 


near-close 
central 
unrounded 
vowel 


(Ctrl+II) 


I 


i 


close central 

unrounded 

vowel 


(Ctrl+m) 




i 


voiced palatal 
fricative 


(Ctrl+J) 




J 


voiced palatal 
plosive 


(Ctrl+JJ) 


J 


J 


voiced palatal 
implosive 


(Ctrl+JJJ) 




i 


velarized 

alveolar 

lateral 

approximant 


(Ctrl+L) 




i 


voiceless 
alveolar 
lateral 
fricative 


(Ctrl+LL) 




L 


velar lateral 
approximant 


/ '/"V— 1 i T T T \ 

(Ctrl+LLL) 




L 


retroflex 
lateral 

approximant 


(Ctrl+LLLL) 


L 


t$ 


voiced 
alveolar 
lateral 
fricative 


(Ctrl+LLLLL) 



Table III Phonemes 





u 


near-close 
near-back 
rounded 
vowel 


(Ctrl+U) 




0 


near-close 
central 
rounded 
vowel 


(Ctrl+UU) 


u 


H 


close central 

rounded 

vowel 


(Ctrl+UUU) 




A 


open-mid 
back 

unrounded 

vowel 


(Ctrl+V) 




n 


labiodental 
approximant 


(Ctrl+VV) 


V 


V 


labiodental 
flap 


(Ctrl+VVV) 




M 


voiceless 

labio-velar 

approximant 


(Ctrl+W) 




UI 


close back 

unrounded 

vowel 


(Ctrl+WW) 


w 


m 


velar 

approximant 


(Ctrl+WWW) 


X 


X 


voiceless 

uvular 

fricative 


(Ctrl+X) 




A 


palatal lateral 
approximant 


(Ctrl+Y) 




Y 


voiced velar 
fricative 


(Ctrl+YY) 




Y 


near-close 
near-front 
rounded 

vowel 


(Ctrl+YYY) 


Y 


T 


close-mid 
back 

unrounded 

vowel 


(Ctrl+YYYY) 




3 


voiced 

postalveolar 

fricative 


(Ctrl+Z) 




\ 


voiced 

retroflex 

fricative 


(Ctrl+ZZ) 


Z 




voiced 
alveolo- 
palatal 
fricative 


(Ctrl+ZZZ) 
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V. CONCLUSION 

Pronunciation is very important in today's communication and currently there has been a shift from 
linguistic competencies to a broader level of communicative compliances. Effective communication is always 
based on the good pronunciation. Pronunciation is reckoned as not just production of the right phonemes. It 
forms the foundation for the next level of speech analysis. With adequate pronunciation skill one can fly to a 
new horizon by achieving professional responsibility. But with the increasing ambiguities in the natural 
language it is difficult to judge the right pronunciation. The goal of this paper was to provide a phonetic 
dictionary. This forms the root for the later part of the research work which is to build the interface for 
recognizing phonetic structure of sounds generated by the natural language mostly in English. The dictionary 
shown in the paper is for Latin -American English alphabets only. It is intentionally kept for these languages in 
order to limit the resources. The same rules will form the ground work to recognize the sound of other native 
languages such as Kannada, Telugu and others. Various researchers can get benefit from the paper by looking 
into the rules and the dictionary for building the sound recognition system. Future scope of the work will be on 
developing a tool for phonetic dictionary by applying the rules specified in the paper. 
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